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Speech perception is an inherently multi- 
sensory process. When having a face-to- 
face conversation, a listener not only hears 
what a speaker is saying, but also sees 
the articulatory gestures that accompany 
those sounds. Speech signals in visual and 
auditory modalities provide complemen- 
tary information to the listener (Kavanagh 
and Mattingly, 1974), and when both are 
perceived in unison, behavioral gains in 
in speech perception are observed (Sumby 
and Pollack, 1954). Notably, this benefit is 
accentuated when speech is perceived in 
a noisy environment (Sumby and Pollack, 
1954). To achieve a behavioral gain from 
multisensory processing of speech, how- 
ever, the auditory and visual signals must 
be perceptually bound into a single, uni- 
fied percept. The most commonly cited 
effect that demonstrates perceptual bind- 
ing in audiovisual speech perception is the 
McGurk effect (McGurk and MacDonald, 
1976), where a listener hears a speaker 
utter the syllable "ba," and sees the speaker 
utter the syllable "ga." When these two 
speech signals are perceptually bound, the 
listener perceives the speaker as having said 
"da" or "tha," syllables that are not con- 
tained in either of the unisensory signals, 
resulting in a perceptual binding, or inte- 
gration, of the speech signals (Calvert and 
Thesen, 2004). 

The ability to perceptually bind sen- 
sory information is notably impaired in 



a number of clinical populations, includ- 
ing those with autism spectrum disor- 
ders (ASD). ASD describes a cluster of 
highly prevalent developmental disabili- 
ties historically characterized by deficits in 
three functional domains: language and 
communication, social reciprocity, and the 
presence of restricted interests/repetitive 
behaviors (APA, 2000). Since its initial 
description, alterations in sensory process- 
ing have been described in this popu- 
lation (Kanner, 1943), yet these deficits 
were acknowledged only in the most 
recent edition of the DSM (APA, 2013). 
Impairments in multisensory perceptual 
binding may be particularly relevant in 
ASD, given that hallmark features of the 
disorder include difficulties in speech, 
communication, and social interactions. 
Successful speech communication is heav- 
ily reliant on binding across sensory 
modalities, and as such, impaired binding 
in individuals with ASD likely contributes 
to these core deficits. 

Impairments in perceptual binding 
have not gone unstudied in ASD. In fact, 
one of the leading theories describing ASD, 
Weak Central Coherence, describes ASD as 
a cognitive style in which focus is selec- 
tively attuned to individual components of 
information to the exclusion of perceiv- 
ing the larger whole; in short, losing the 
proverbial forest for the trees (Frith and 
Happe, 1994; Happe, 1999, 2005; Happe 



and Frith, 2006). Evidence for this has 
been found across a wide range of tasks. 
For example, individuals with ASD bene- 
fit less than individuals without ASD from 
context when interpreting a sentence or 
story (Happe, 1994; Jolliffe and Baron- 
Cohen, 1999), but are more accurate than 
individuals without ASD when focusing 
on explicit local details of a passage (Noens 
and Berckelaer-Onnes, 2005). 

In the realm of sensory perception, 
binding deficits in ASD have been stud- 
ied most extensively in the visual modality. 
Here too, individuals with ASD have been 
shown to have a strong local bias at the 
expense of global processing (Behrmann 
et al., 2006). A clear example of this is 
observed in response to hierarchical let- 
ters (large letters composed of smaller let- 
ters; Navon, 1977). When performing a 
task reliant upon the identify the gestalt 
of the image (the large letter) relative to 
the individual units (small component let- 
ters), individuals with ASD show impaired 
performance (Behrmann et al., 2006). 

The ability of individuals with ASD to 
bind across sensory modalities has been 
studied to a much lesser extent, but those 
studies that have been conducted com- 
monly find deficits in multisensory per- 
ceptual binding, particularly with speech 
signals. The majority of the research sug- 
gests that individuals with ASD perceive 
the McGurk illusion less often than their 
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peers without ASD (de Gelder et al., 1991; 
Williams et al, 2004; Mongillo et al, 2008; 
Irwin et al, 2011; Bebko et al, 2014; 
Stevenson et al., 2014, in press; but see 
Iarocci and McDonald, 2006; Woynaroski 
et al, 2013), often relying instead on the 
auditory modality to the exclusion of the 
visual information (Mongillo et al, 2008; 
Stevenson et al, 2014, in press). While 
individuals with ASD may be able to per- 
ceptually bind information under optimal 
conditions, these results imply that indi- 
viduals with ASD show reduced efficiency 
when binding speech information across 
auditory and visual modalities, particu- 
larly in noisy, real-world contexts (Foxe 
et al., 2013). As a consequence, signals are 
perceived in isolation, or as fragmented 
units rather than as a meaningful whole. 
Thus, the efficiency gained from process- 
ing multiple sensory signals as a single 
percept, for example the visual sensory 
inputs associated with a speaker integrated 
with the auditory sensory inputs associ- 
ated with a speaker (Stevenson et al, 2010, 
2011), would be lost, resulting in more 
inefficient sensory processing overall. 

Given the findings that individuals with 
ASD show reduced perceptual binding of 
audiovisual speech signals, it has been 
hypothesized that individuals with ASD 
would not exhibit the behavioral gains 
observed with the perception of multi- 
sensory signals. The few studies to date 
that have investigated multisensory per- 
ception of audiovisual speech have shown 
that children with ASD do in fact show less 
behavioral gain (i.e., less improved per- 
ception) with audiovisual speech than do 
their typically developing peers (Alcantara 
et al, 2004; Smith and Bennetto, 2007; 
Irwin et al, 2011; Foxe et al., 2013). This 
finding is especially salient when speech is 
embedded in a high degree of background 
noise (Foxe et al., 2013), the very con- 
dition in which (A) typically developing 
children show a high level of multisensory 
gain and (B), this multisensory integra- 
tion would be most beneficial for success- 
ful speech communication. The validity 
of the relationship between multisensory 
perception and real-world communication 
has been demonstrated via correlations 
between the accurate perception of audio- 
visual speech and communication scores 
from the Autism Diagnostic Observation 
Schedule (Lord et al, 2000), the gold 



standard for diagnostic testing in ASD. 
Individuals who were better able to accu- 
rately perceive audiovisual speech were less 
impaired in terms of communicative abili- 
ties (Woynaroski et al, 2013). 

Interestingly, multisensory speech inte- 
gration is not a static process, but one that 
continues to mature and fine tune over 
development (Hillock et al., 201 1; Hillock- 
Dunn and Wallace, 2012). While young 
children with ASD are clearly delayed in 
their ability to benefit from multisensory 
speech perception compared to their typ- 
ically developing peers, there is evidence 
that this impairment lessens with matura- 
tion (Foxe et al., 2013). Likewise, the first 
study of the McGurk Effect across devel- 
opment showed a similar pattern, in which 
young children with ASD perceived the 
McGurk Effect much less frequently than 
their peers without ASD, but "caught up" 
later in development (Taylor et al., 2010; 
but see Stevenson et al., in press). 

A critical question then, is what is 
the underlying cause of these disruptions 
in speech perception observed in ASD? 
One possibility is that individuals with 
ASD have impaired temporal processing 
abilities. One neurobiological account of 
ASD, the temporal binding hypothesis of 
autism (Brock et al., 2002) proposes just 
that. In terms of binding across sensory 
inputs, perceiving the timing of incom- 
ing sensory information is paramount to 
the ability to perceptually bind stimuli 
across sensory modalities. The temporal 
synchrony of such inputs is one, if not the 
most, salient cue that two inputs should 
be bound (Vroomen and Keetels, 2010). 
Previous research shows a clear pattern 
that individuals with ASD are significantly 
impaired in judging the relative timing of 
auditory and visual speech signals (Bebko 
et al, 2006; Foss-Feig et al, 2010; Kwakye 
et al., 2011; de Boer-Schellekens et al., 
2013; Woynaroski et al., 2013; Stevenson 
et al., 2014), and importantly, this research 
also showed a direct correlation between 
multisensory temporal acuity and the abil- 
ity to perceptually bind audiovisual speech 
signals in individuals with ASD (Stevenson 
et al, 2014). 

These findings, taken in sum, suggest 
that deficits in binding across auditory and 
visual modalities in ASD may have a cas- 
cading impact on speech perception and 
social processing, key clinical symptoms 



defining ASD. In most social communica- 
tive interactions, failing to perceive the 
auditory and visual components of the 
environment can result in missing criti- 
cal social cues, not to mention the con- 
tent of the message being conveyed. Failing 
to perceive a speaker's message as a sin- 
gle, unified percept, essentially doubles 
the number of perceived inputs, result- 
ing in an increasingly "noisy" or "intense" 
world — as is often described in the case of 
autism (Just et al., 2004; Markram et al., 
2007; Rippon et al., 2007; Pouget et al, 
2009). 

The impact of an inability to percep- 
tually bind across senses on other aspects 
of cognition has been well characterized 
in a patient with bilateral parietal hypop- 
erfusion (Hamilton et al., 2006). This 
patient, AWF, began to perceive what he 
heard and what he saw as being out of 
sync. As a result of this atypical mul- 
tisensory temporal processing, AWF was 
unable to perceptually bind audiovisual 
speech, indexed by an inability to perceive 
the McGurk Effect. Additionally, AWF 
no longer showed the typical behavioral 
benefits with he was shown a speakers 
mouth and articulatory gestures accompa- 
nying auditory speech. While the etiology 
of AWF's impairment is clearly distinct 
from ASD, the parallels in the percep- 
tion of audiovisual speech are striking. 
Furthermore, AWF's describes coping with 
his asynchronous environment by limit- 
ing face-to-face conversations and looking 
away from the face during in-person con- 
versations, both behaviors commonly seen 
in ASD. Such a coping strategy may reflect 
the perceived avoidance of social interac- 
tions in ASD, which may relate more to 
limiting the amount of perceptual noise in 
the environment. A similar argument has 
been made for self-stimulation or "stim- 
ming" behaviors commonly observed in 
ASD. It is possible that these repetitive 
movements provide a predictable and con- 
trolled sensory experience in an otherwise 
chaotic world (Jones et al, 2003). 

While the impact that atypical sen- 
sory binding appears to have on the core 
symptoms associated with ASD is sup- 
ported by research, the issue of how to 
translate these findings into clinical prac- 
tice has been largely unexplored (note 
here that treatments commonly referred 
to as "sensory integration therapy" do 
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not in fact focus on binding or integrat- 
ing information across sensory modal- 
ities). Intensive Behavioral Intervention 
(IBI) is the evidence-based treatment of 
choice for ASD; however, the degree of 
gain made by any one child is difficult 
to predict. While milder autism severity, 
higher adaptive functioning, and higher 
cognitive skills are related to better out- 
comes, there remain unaccounted for fac- 
tors which may predict which children 
benefit most from treatment (Flanagan 
et al., 2012). Given that sensory and mul- 
tisensory processing are foundational to 
the higher-level cognitive, communicative, 
and social functioning that treatments aim 
to address, knowledge of an individual's 
ability to process sensory information is a 
critical and necessary first step to benefit 
maximally from intensive intervention. 

These possible clinical implications are, 
at this stage, highly speculative. The pos- 
sible upsides, however, of moving this 
research from the laboratory into real- 
world settings are significant. A clear 
consensus of evidence suggests that indi- 
viduals with ASD process and integrate 
sensory information in an atypical man- 
ner, and that this is strongly linked to 
core impairments in communicative and 
social abilities. A number of research 
questions must be addressed in order to 
explore these possibilities. First, longitudi- 
nal studies of individuals with ASD need 
to be conducted to directly assess how 
speech and communication skills develop 
in conjunction with sensory processing, 
specifically binding across sensory modali- 
ties and multisensory temporal processing. 
Second, the mediating or moderating 
effect that specific sensory-processing phe- 
notypes in ASD have on the efficacy of 
evidence-based treatments such as IBI is 
sorely needed (in addition to other vari- 
ables such as IQ and gender; Wolery 
and Garfinkle, 2002; Rogers and Vismara, 
2008). Finally, research should ultimately 
go beyond documenting the sensory and 
multisensory processing abilities of indi- 
viduals with ASD and in addition, should 
also reveal how these abilities can be 
dynamically modulated. Plasticity within 
the relevant perceptual systems has been 
amply demonstrated (Fujisaki et al., 2004; 
Powers et al, 2009; Stevenson et al, 2013; 
Schlessinger et al., in press), but these 
findings have been not yet been applied to 



populations with ASD. Pursuing these and 
related studies has the potential to not only 
add to our understanding of ASD, but also, 
through clinical application, to improve 
the quality of life of individuals with ASD. 
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